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Abstract 

Background: Genetic alterations could be responsible lung cancer, the leading cause of worldwide cancer death. 

Methods: This study investigated gene mutations in a Han Chinese fannily of lung cancer using the whole genome 
exome sequencing and subsequent Sanger sequencing validation and then confirmed alteration of prominin 1 
{PROMl) and cyclic AMP-response element binding protein-regulated transcription co-activator2 {CRTC2) in blood 
samples of 343 sporadic lung cancer patients vs. 280 healthy controls as well as in 200 pairs of lung cancer and the 
corresponding normal tissues using PCR-restriction fragment length polymorphism and directed DNA sequencing 
of PCR products. 

Results: The data showed PROMl (p. S281 R) and CRTC2 (p. R379C) mutations, in 5 and 2 cases of these 343 sporadic 
lung cancer patients, respectively. Notably, these mutations were absent in the healthy controls. Furthermore, in the 
200 lung cancer and the matched normal tissues, PROMl mutation occurred in 3 patients (i.e., one squamous cell 
carcinoma and two adenocarcinomas) with a mutation frequency of 1.5%, while CRTC2 mutation occurred in 5 patients 
(two squamous cell carcinomas and three adenocarcinomas) with a mutation frequency of 2.5%. 

Conclusions: The data from the current study demonstrated novel PROMl and CRTC2 mutations, which could promote 
lung cancer development. 
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Background 

Lung cancer is a significant worldwide health problem, 
accounting for 13% (1.6 million) of the total cancer cases 
and 18% (1.4 million) cancer deaths [1]. In China, data 
from the most recently retrospective sampling survey 
showed that the rate of lung cancer death rate was 
30.83/10^ (41.34/10^ for men and 19.84/10^ for women) 
[2]. The overall five year survival rate of lung cancer re- 
mains approximately 15% worldwide, driven by the ma- 
jority of cases diagnosed at advanced stages, resulting in 
non-resectable lesions [1]. Tobacco smoking is the pre- 
dominant risk factor for both small cell lung cancer and 
non-small cell lung cancer (NSCLC) [3]. However, to 
date, the defined pathogenesis of lung cancer remains to 
be determined because only a small fraction of tobacco 
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smokers have developed lung cancer, but a certain percent- 
age of lung cancer patients are never smokers, indicating 
that there is individual variation in cancer susceptibility in 
the general population and there are other factors, such as 
genetic factors or host contribution to a predisposition of 
lung cancer development [4]. Another risk factor in lung 
cancer development is familial aggregation and history, 
which associated with 73% increase (95% confidence inter- 
val [95%CI]: 50%- 100%) in lung cancer risk in women and 
71% increase (95%CI: 49-96%) in men [5]. Individuals with 
the first-degree relative lung cancer had a 1.51-fold in- 
crease in lung cancer risk compared to individuals without 
a family history (95%CI: 1.39-1.63) [6]. Thus, lung cancer 
risk models using epidemiologic data have been developed 
and the most parsimonious models for both ever and never 
smokers include a family history as a lung cancer risk [7]. 

Indeed, lung cancer can be aggregated in the family 
and inherited and appears to be the result of an interaction 
of multiple genes and environmental factors. Thus, search 
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for a gene or genes associated with susceptibility is urgently 
needed. Previous studies showed that several tumor sup- 
pressor genes were inactivated in lung cancer patients, 
including TPS3 [8] RBI [9,10], and PTEN [11], while infre- 
quent activating mutations or amplifications of PIK3CA, 
EGFR and KRAS, and MYC did occur in certain lung can- 
cer patients [12]. Recent genome- wide association studies 
(GWAS) identified some loci that associated with lung 
cancer development [13,14]. Chromosomal region 15q24- 
25.1, containing nicotinic acetylcholine receptor sub-unit 
genes, has been associated with increased risk of lung can- 
cer in ever smokers [13,14]. Linkage analysis in families 
with aggregation of lung cancer showed a region on 
chromosome 6q23-25 associated with risk of lung cancer 
[15]. These data plus data among individuals with a family 
history of lung cancer indicate that the relative risk of lung 
cancer associated with markers in this region is much 
higher in familial cases compared to the relative risk ob- 
served among sporadic cases [16,17]. 

Thus, in this study, we detected gene mutations in a 
Han Chinese family of lung cancer using the whole 
genome exome sequencing [18] and subsequent Sanger 
sequencing validation and then confirmed these gene alter- 
ations in blood samples of 343sporadic lung cancer 
patients vs. 280 healthy controls as well as in 200 pairs of 
lung cancer and the corresponding normal tissues using 
PCR-restriction fragment length polymorphism and di- 
rected DNA sequencing of PCR products. 

Results 

Detection of PR0M1 T/G and CRTC2 G/A mutations in 
members of lung cancer family using whole genome 
Exome sequencing 

In this study, we performed whole genome Exome 
sequencing on genomic DNA samples of the four af- 
fected and unaffected relatives in this lung cancer family 
(Table 1). To identif)^ potential genetic variants associated 
with lung cancer development, we generated an average of 
4.9 Gb of DNA sequence with 50X average coverage per 
individual as single-end, 90-bp reads and the fraction of 
effective bases on target was about 50% with a minimum 
54-fold of average sequencing depth on the target. At this 
depth of coverage, more than 97% of the targeted bases 
were sufficiently covered to pass the thresholds of variant 
calling. We then compared the variants with the Han 
Chinese Beijing SNPs from dbSNP132 and the 1000 
Genome Project. This generated a total of 57 genetic 
variants (including 55 non-synonymous SNPs and 2 splice) 
that were shared by the two patients, but absent in the 
two healthy members, which were predicted to potentially 
have a functional impact on the gene expression. After 
that, we directly sequenced PCR-products to validate these 
57 variants and obtained accuracy of 67% (38/57) for those 
variants. We found that mutations of PROM 1 and CRTC2 



in two lung cancer patients of family cases, but absent in 
the two healthy members (Table 1). 

Validation of genetic variants 

To confirm the 38 validated genetic variants, we per- 
formed PCR-restriction fragment length polymorphism 
(PCR-RFLP) analysis of blood samples from 343 unrelated 
sporadic lung cancer patients and 280 healthy controls 
(Table 2). This cohort of 343 lung cancer patients had a 
mean age of 62.04 years old (ranged between 30 and 84 
years) with 63.56% of men and 36.44% women, whereas the 
280 control individuals had a mean age of 59.98years old 
(ranged between 35 and 80 years) with 69.64% men and 
30.64% women. Age and gender between cases and con- 
trols were well balanced. We found mutations in PROMl 
and CRTC2 in these two family NSCLC cases (Figure 1 and 
Table 1). PROMl mutation was a T to G change, resulting 
in an S281R amino acid change. The PROMl mutation was 
validated in 5 samples, including 3 adenocarcinomas 
(ADC) and 2 squamous carcinomas (SCC), with mutation 
rate of 1.4% (Table 3). Moreover, CRTC2 mutation was vali- 
dated with a G to A change, resulting in an R379C amino 
acid change (Table 3) in two lung cancer samples, including 
two ADCs, with mutation rate of 0.5%. However, both gene 
mutations were absent in these 280 controls. 

In addition, we further verified PROMl and CRTC2 
mutations in an additional 200 pairs of lung cancer and 
the matched normal tissue specimens (Table 4). We 
found PROMl mutation in 3 pairs of lung cancers and nor- 
mal tissues, including 1 SCC and 2 ADCs with mutation 
frequency being 1.5% and CRTC2 mutation in 5 pairs of 
lung cancers and normal tissues, including 3 ADCs and 2 
SCCs with mutation frequency being 2.5%. All mutations 
were further confirmed by direct DNA sequencing of PCR 
products (Figure 1 and Table 3). 

Discussion 

In the current study, we sequenced and compared the 
whole genome coding regions of genes in two lung can- 
cer patients and two healthy family members and then 
filtered the benign changes using public databases, in- 
cluding the 1000 Genome Project and dbSNP132. Use of 
the second-generation sequencing technique produced a 
high level of coverage with higher accuracy and allows 
more regions of a genome to be sequenced in very cost 
effective manner. We successfully identified PROMl and 
CRTC2 mutations. Our data are novel and mutation of 
these two genes had not been previously reported in 
lung cancer. After that, we confirmed our data in blood 
samples of 343 lung cancer patients, but not in 280 
healthy controls. These two gene mutations were not re- 
ported in the 1000 Genomes data and dbSNP132. 

PROMl is localized at chromosome 4pl5.32 and con- 
tains 27 exons to code CD 133 protein. To date, the 



Table 1 Clinicopathological features of the lung cancer family with PR0M1 and CRTC2 mutations 



Patient 
ID 


Gender 


Age 
(years) 


Alcohol 
consumption 


Tobacco 
smoking 


Pack 
years 


Medical history 


Histological 
type 


TNM (Stage) 


Tumor differentiation 


PROMl mutation 


CRTC2 mutation 






















Nucleotide 
change 


Amino acid 
change 


Nucleotide 
change 


Amino acid 
change 


114 


Male 


65 


Frequent 


Former 


1095 


COPD, bronchiectasis 


Adenocarcinoma 


TlaNOMO 


Moderate 


841 T>G 


S281R 


1135G> A 


R379C 


1111 


Male 


51 


Frequent 


Former 


365 


None 


Adenocarcinoma 


TlbNOMO 


Poor-moderate 


841 T>G 


S281R 


1135G> A 


R379C 


1114 


Female 


37 


Never 


Never 




None 


N/A 


N/A 


N/A 


None 


None 


None 


None 


IVl 


Male 


10 


Never 


Never 




None 


N/A 


N/A 


N/A 


None 


None 


None 


None 
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Table 2 Clinicopathological features of 343 sporadic 
NSCLC cases and 280 healthy controls 



Lung cancer Control 



Characteristics 


N(%) 


N(%) 


Age in years: median 


62.04 ± 10.86 


59.98 ± 9.65 


Gender 






Female 


125(36.44) 


85(30.36) 


Male 


218(63.56) 


195(69.64) 


Smoking status 






Never 


68(19.82) 


N/A 


Ever 


107(31.20) 


N/A 


N/A 


168(48.98) 


N/A 


Pack years (median, range) 


432(20-732) 


N/A 


Histological types 






Adenocarcinoma 


120(34.98) 




Squamous cell carcinoma 


204(59.48) 




Adenosquamous carcinoma 


8(2.33%) 




Others 


11(3.21)' 




Clinical stage^ 
1 

II 
III 


76(22.89) 
58(17.47) 
91(27.41) 




IV 


107(32.23) 




Tumor differentiation^ 






Poor 


204(61.44) 




Moderate and poor-moderate 


121(36.45) 




Well and moderate-well 


7(2.11) 




Lymph node metastasis 






Negative 


154(44.90) 




Positive 


189(55.10) 





^ including: 11 small cell lung cancer; "^for non-small cell lung cancer. 



precise functions of CD 133 protein are unknown al- 
though it is a member of five-transmembrane glycopro- 
teins. The latter proteins specifically localize to plasma 
membrane protrusions. CD 133 protein has two short 
N (extracellular)- and C (cytoplasmic) -terminal tails, 
and two large N-glycosylated extracellular loops (be- 
tween TM2 and -3,and TM4 and -5). PROMl can 
translate into 7 isoforms of CD 133 protein through the 
alternative splicing in human tissues and the alterna- 
tively spliced exons in the coding region only affect the 
short N- and the C-terminal domains [19,20]. CD133 
is expressed in various normal and stem cells; thus, it 
was classified as a marker of primitive hematopoietic and 
neural stem cells. Recently, rapidly accumulated evidence 
indicates that CD 133 had been described as the most im- 
portant marker inherent to a number of types of cancer 
stem cells (CSCs) [21-24]. These CSCs can initiate the 



tumors with performing unique functions, such as asym- 
metric division, self-renewal, drug resistance and quies- 
cence [25]. A previous study reported that a rare 
population of tumorigenic cells in small cell and non- 
small cell lung cancer expressed CD133 protein [26]. 
Lung cancer contained a rare population of CD 133"^ 
CSCs able to self-renew and generated an unlimited pro- 
geny of non-tumorigenic cells [27]. Cui et al investigated 
several human lung cancer cell lines and found that 
CD 133 was a marker for the small cell lung cancer and 
had stem cell-Uke features, such as self-renewal, dif- 
ferentiation, proliferation and tumorigenic capacity 
[25]. PROMl mutations, including p.R373C, p.Q576X, 
p.G614fsX626, and p.Y452fsX12, were associated with 
a heterogeneous group of inherited retinal disorders with 
autosomal recessive and dominant inheritance patterns 
[26,28] . In our current study, we identified somatic PROMl 
mutation (p.S281R) in 8/550 lung cancer patients and this 
novel PROMl mutation was a T to G change, resulting in 
an S281R amino acid change. All of the 8 patients had 
non-small cell lung cancer (i.e., 5 adenocarcinomas and 3 
SCCs), five of which developed lung cancer at the age 
younger than 60 years old and two of which were classified 
as stage IV disease with poor differentiation. The mutation 
of PROMl is novel in lung cancer and further study will 
investigate the role of CD 133 protein in lung cancer devel- 
opment and progression. 

Furthermore, CRTC2 is localized at chromosome 
lq21.3 and codes a 693 amino acid protein, namely 
CRTC2 [29]. CRTC2 protein contains an N-terminal 
GREB-binding domain (CBD), a central regulatory (REG) 
domain, a splicing domain (SD), and a C-terminal domain 
(TAD). CRTCs have shown to induce expression of cyclic 
AMP-responsive genes [30]. CRTCs are sequestered in the 
cytoplasm through phosphorylation-dependent interac- 
tions with 14-3-3 proteins. In the absence of AMPK activ- 
ity, CRTC2 is dephosphorylated and translocated into the 
nucleus where it associated with CREB and induces ex- 
pression of the target genes [31]. It has reported that 
CRTC2 was able to regulate gluconeogenesis by integrat- 
ing calcium and metabolic signaling through calcineurin 
and AMPK/SIK kinase families [32]. Another study identi- 
fied LKBl as an essential activator for the AMPK gene 
family and a key regulator for CRTC2 transcriptional ac- 
tivity [33], while Shaw et al demonstrated that SIKI may 
participate in mediating LKBl tumor suppressor activity 
[34]. Ji et al found that LKBl mutations are detected 
in human tumor samples, including lung cancer [35]. 
Komiya et al found enhanced activity of CRTCl in LKBl- 
null lung cancer [36]. Brown et al reported CRTC2 play 
an important role in breast cancer of postmenopausal 
women [37]. In our current study, we found somatic 
CRTC2 mutation in 7/550 lung cancer patients. This 
novel mutation site could change G to A and result 
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Figure 1 Somatic gene mutations in III-1 patient. A, P/?0/W/ mutation. B, CRTC2 mutation. Tumor genomic DNA of tine affected family, 
genomic DNA of blood sample from lung cancer patient, genomic DNA from lung cancer and the matched normal tissues. 



Table 3 Clinicopathological features of patients with PROMI and CRTC2 mutations 



Patient 


Gender 


Age (yrs.) 


Type 


T 


N 


M 


Stage 


Differentiation 


Tumor size (cm) 


Family history 


Smolcing status 


Paclc years 


















PROMI 










LC^-15 


M 


58 


ADC 


la 


0 


0 


lA 


Poor 


1 .2 X 1 X 0.8 


No 


Never 




LC-116 


M 


70 


sec 


4 


0 


0 


IIIA 


Poor-moderate 


13.5X 12x6 


Yes 


Current 


182 


LC-122 


F 


59 


ADC 


2a 


1 


lb 


IV 


Poor 


4.3 X 2.3 X 3 


No 


Never 




LC-215 


M 


68 


sec 


2b 


1 


0 


IIB 


Poor 


6.1 X 4.9x5 


No 


Current 


546 


LC-275 


F 


54 


ADC 


4 


2 


la 


IV 


Poor 


17x11 X4.1 


No 


Never 




LCT^-17 


M 


58 


ADC 


2a 


1 


0 


IIA 


Moderate 


4x3.5x3.5 


No 


Current 


182 


LCT-51 


M 


61 


sec 


la 


0 


0 


lA 


Poor 


2 X 1 .5 X 1 .4 


No 


Current 


364 


LCT-181 


F 


56 


ADC 


2a 


1 


0 


IIA 


Moderate-well 
CRTC2 


4x3x3 


No 


Never 




LC-207 


M 


72 


ADC 


2a 


0 


0 


IB 


Moderate 


2.3 X 3.4 X 2 


No 


Current 


365 


LC-250 


F 


65 


ADC 


2a 


1 


0 


IIA 


Poor-moderate 


4x4x3.5 


No 


Never 




LCT-81 


F 


57 


ADC 


2a 


0 


0 


IB 


Poor-moderate 


3 X 2.5 X 2.5 


No 


Never 




LCT-94 


M 


69 


sec 


lb 


0 


0 


lA 


Poor 


2.5 X 1 .5 X 1 .0 


No 


Current 


500 


LCT-1 1 1 


M 


52 


sec 


2a 


1 


0 


IIA 


Poor-moderate 


4x3.5x2.5 


No 


Current 


280 


LCT-160 


M 


41 


ADC 


2a 


1 


0 


IIA 


Moderate 


4x3.5x4 


Yes 


Never 




LCT-1 71 


F 


69 


ADC 


2a 


0 


0 


IB 


Poor-moderate 


4x4x3 


No 


Never 





^blood sample of lung cancer; "^tissue sample of lung cancer and matched normal lung tissue. 
ADC, adenocarcinoma; SCC, squamous cell carcinoma. 
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Table 4 Clinicopathological features of these 200 
sporadic lung cancer tissues 



Lung cancer 



Characteristics 


N(%) 


Age in years: median 


57.93 ± 10.480 


Gender 




Female 


56(28.00) 


Male 


144(72.00) 


Smoking status 




Never 


92(46.00) 


Ever 


108(54.00) 


Pack years (median, range) 


741.94(10-3000) 


Histological type 




adenocarcinoma 


95(47.50) 


squamous cell carcinoma 


93(46.50) 


adenosquamous carcinoma 


10(5.00) 


others 


2(1.00)' 


Clinical stage^ 




1 


54(27.27) 


II 


56(28.28) 


III 


70(35.35) 


IV 


18(9.10) 


Tumor differentiation^ 




Poor 


92(46.47) 


Moderate and poor-moderate 


100(50.51) 


Well and moderate-well 


6(3.02) 


Lymph node metastasis 




Negative 


98(49.00) 


Positive 


102(51.00) 



^including: 2 neuroendocrine carcinoma, 
"^for non-small cell lung cancer. 



in an R379C amino acid change. All of these 7 patients had 
stage I and II non-small cell lung cancer (i.e., 5 adenocar- 
cinomas and 2 SCCs) without metastasis. However, it 
remains unknown what function of CRTC2 protein plays 
in lung cancer and further studies are warranted. 

Conclusions 

In summary, our current study identified and confirmed 
novel mutations of PROM 1 and CRTC2mNSCLC patients. 
Since the numbers affected patients are too small, we don't 
know whether these mutations can affect survival of pa- 
tients. Further studies will investigate the role of these two 
proteins in lung cancer development and progression. 

Methods 

Study population 

This study included a four-generation Chinese Han fam- 
ily with lung cancer cases that had 13(6 living) family 



members (Figure 2; Table 1). The proband (II-4) was 
a 65-year-old man who had been diagnosed with 
adenocarcinoma of the lung in 2012 with a 10-year 
history of chronic obstructive pulmonary disease (COPD) 
and bronchiectasis. He had smoked for the past 38 years 
with approximately 1095 pack per year. Biochemical 
examination showed an increased CEA (8.97 ng/mL, 
normal < 5 ng/mL), while CT scanning showed a single 
nodule with 1.5 x 1.5 x 1.0 cm in the apicoposterior seg- 
ment of the left upper lobe. This probands 51 -year-old 
nephew (III-l) presented with cough, hemoptysis and 
dyspnea. CT scanning confirmed unilateral mass in the 
superior segment of the right lower lobe. In 2009, he 
underwent an open lung lobectomy and the result of 
histopathological examination confirmed as lung adeno- 
carcinoma. In this family, other members also died of 
lung cancer (II-2), hepatocarcinoma (1-2, II-3, III-2), or 
gastric cancer (II-5). 

Furthermore, to confirm gene mutations in lung can- 
cer of this family members, we recruited 343sporadic 
lung cancer patients vs. 280 healthy controls as well as 
200 pairs of lung cancer and the corresponding nor- 
mal tissues from The West China Hospital of Sichuan 
University, The Second People s Hospital of Sichuan, and 
The Seventh Peoples Hospital of Chengdu, Chengdu, 
China between March 2010 and June 2012. All of the 
sporadic lung cancer patients and the healthy controls 
were non-related and were of Chinese Han ethnicity 
(Table 2). Those with secondary lung cancer or other ser- 
ious disease were intentionally excluded. The diagnosis 
was confirmed by histopathological examination of the 
resected or biopsy tissue specimens in all cases. The demo- 
graphic and clinical information were collected, including 
sex, age at admission, tobacco smoking, histological diag- 
nosis, tumor-node-metastasis clinical stage according to 
the American joint committee on cancer 2010 guidelines 
[38,39]. The study was approved by the Ethics Committee 
of West China Hospital of Sichuan University and all par- 
ticipants provided informed consent. 

DNA samples 

Peripheral blood collected from patients and controls 
into an anticoagulation tube and stored at -80°C until 
use. Genomic DNA was then extracted using a commercial 
DNA isolation kit from Bioteke (Beijing, China) according 
to the manufacturers instructions. Tumor tissues were 
fixed in formalin and embedded in paraffin and then sec- 
tioned for hematoxylin and eosin staining. Pathologists 
then identified areas of adequate tumor lesion (>70%) for 
DNA extraction. The normal lung tissue of the patients 
obtained during surgical resection represented lung tissue 
located more than 5 cm from tumor lesions. Genomic 
DNA was extracted using the Tissue Kit (Qiagen, Valencia, 
CA) according to the manufacturer s protocol. 
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III 



IV 




Figure 2 Pedigree of thiis lung cancer family. Hepatocarcinoma occurred in 1-2, 11-3, and III-2; lung cancer in 11-2, 11-4, and lll-l; and gastric 
cancer in 11-5. 



Whole genome exome capture and massively parallel 
DNA sequencing 

We then performed a whole genome exome sequencing 
to identify the disease-causing genetic variant for this 
lung cancer family. Briefly, 15 (ig of genomic DNA sam- 
ples from each of these four individuals (II-4, III-l, III-4, 
and IV- 1) were separately sheared into approximately 
200 to 300 bp DNA fragments by Covaris, and adapters 
were then ligated to both ends of the resulting DNA 
fragments and amplified by ligation-mediated PGR (LM- 
PCR), purified, and hybridized to the NimleGen 2.1 M 
human exome array for enrichment, non-hybridized 
fragments were then washed out. Both non-capture and 
capture LM-PCR products were subjected to quantitative 
PGR to estimate the magnitude of enrichment. Each cap- 
ture library was then loaded on Hiseq2000 platform, and 
high-throughput sequencing was performed for each cap- 
tured library independently to ensure that each sample at 
least 50-fold coverage. Raw image files were processed by 
Illumina base calling Software 1.7 for base calling with de- 
fault parameters and the sequences of each individual 
were generated as 90 bp paired-end reads. 

Sequencing reads were aligned to the human genome 
(hgl9) as the reference genome sequence, together with its 
gene annotation that was downloaded from the UCSC 
database (http://genome.ucsc.edu/). Single nucleotide poly- 
morphisms (SNPs) were identified by SOAPsnp [40] and 
small insertion/deletions (InDels) were detected by SAM 
tools [41]. SNPs and indels were defined based on satisfac- 
tion of the following: i) The consensus quality score 
was bigger than Q20 (the quality score is a Phred score, 
generated by the program SOAPsnp 12, quality score 20 



represents 99% accuracy of a base call); ii) The depth 
of supported reads at that locus was no less than 4X 
and no more than lOOOX; and iii)The distance be- 
tween two SNPs no less than 5. The changes that were 
shared in the two affected individuals, but absent in 
the two unaffected individuals were obtained by fur- 
ther comparison of the variants of each of the four 
individuals. All changes were filtered against the Han 
Chinese Beijing SNPs in the dbSNP132, and the 1000 
Genome Project (February 28, 2011 releases for SNPs and 
February 16, 2011 releases for indel http://www.1000 
genome.org). All variants were then confirmed by direct 
DNA sequencing of polymerase chain reaction (PGR) 
product (BigDye® Terminator v3.1 Gycle Sequencing Kits; 
Applied Biosystems, Foster Gity, GA). 

Validation of gene mutations 

To further verify the genetic variants identified in these 
two lung cancer patients sporadic lung cancer patients 
and the healthy controls, we performed PGR according 
to the manufacturer s instructions. Primers were synthe- 
sized by using the on-line software (http://frodo.wi.mit. 
edu/primer3/) for PGR amplification of variants identi- 
fied via exome sequencing. The primers of PROMl mu- 
tation were 5'-GAGGGGAGGGTAGTTTTGAG-3' and 
5'-GTTGGAGTGTGTGGGTGTGA-3' and the primers 
of CRTC2 mutation were5'-GAGGAGGAAGAGGAG 
GAGGA-3' and 5'-GTAAGGAATGGGAAGGTGGA-3'. 
PGR products were then digested overnight with specific 
restriction enzyme Mspl and separated by a 6% poly- 
acrylamide gel electrophoresis and stained with 1.5 g/1 of 
argent nitrate to visualize both PROMl T/G and CRTC2 
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G/A mutations. The genotypes and all mutations were 
confirmed by the DNA sequencing analysis (BigDye® 
Terminator v3.1 Cycle Sequencing Kits; Applied Biosys- 
tems, Foster City, CA). Approximately 10% of the cases 
were randomly selected for the repeated assays and the 
results were 100% concordant. 
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